Coarse-Grained Model of Entropic Allostery 
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Many signalling functions in molecular biology require proteins to bind to substrates such as 
DNA in response to environmental signals such as the simultaneous binding to a small molecule. 
Examples are repressor proteins which may transmit information via a conformational change in 
response to the ligand binding. An alternative entropic mechanism of "allostery" suggests that the 
inducer ligand changes the intramolecular vibrational entropy, not just the mean static structure. 
We present a quantitative, coarse-grained model of entropic allostery, which suggests design rules 
for internal cohesive potentials in proteins employing this effect. It also addresses the issue of how 
the signal information to bind or unbind is transmitted through the protein. The model may be 
applicable to a wide range of repressors and also to signalling in transmembrane proteins. 



Introduction. It is becoming increasingly clear that 
dynamics, as well as static structure, are important in 
molecular biology. For example simulations of dynamical 
transitions in proteins [l| suggest that collective global 
modes are correlated with protein function. This letter 
focuses on repressor proteins which bind to DNA to "turn 
off" genes when the cell does not require their expression. 
The binding is "allosteric" : it is activated depending on 
the presence of inducer ligands, small molecules which 
themselves bind to the protein. The "holorepressor" 
( "aporepressor" ) is the protein with (without) a bound 
ligand. Frequently the ligand binding site on the protein 
is distant from that of the DNA. For this reason ligand 
binding has often been assumed to cause a conforma- 
tional change within the repressor protein, decreasing its 
affinity for DNA in one state compared to the other state. 
However, this is evidence that dynamically induced en- 
tropic changes may contribute to allostery 

A classic example of a repressor system is the E-coli 
lac repressor [a Ifl S H • I n this case the aporepressor 
binds to DNA, suppressing the genes for the metabolism 
of lactose. A second example is the E-coli trp repres- 
sor 0,0,0 that, on binding, prevents the expression 
of the gene for tryptophan synthesis. But in contrast to 
the lac, trp-type holorepressor proteins bind to DNA and 
the aporepressors do not. There are many such repressor 
systems but the lac and the trp will act as representative 
cases for this Letter. Our challenge is to explore whether 
the Brownian fluctuations in protein structure may carry 
information between the two binding sites, thereby pro- 
ducing cooperative lac-type or trp-type behaviour. This 
mechanism of cooperativity is one of the key questions in 
understanding protein function 13]. 

A coarse-grained model. In a coarse-grained represen- 
tation we model a repressor protein dimer as two rigid 
plates of length I and width w, representing the two pro- 
tein monomers motivated by the common dimer motif 
(see Figure nj. We parameterise the relative motion of 
the plates by three relative translation displacements (x, 
y, z) and three rotation angles (9 X , 9 y , 9 Z ). The sta- 



bilising contact interaction between the two dimers is 
characterised by 5 quadratic (3D) potential wells. Fig- 
ureQ]shows 5 of the effective springs that arise, Af which 
are perpendicular to the plates. We find that a min- 
imal model requires just 4 other springs in the plane 
of the plates (summations of diagonal springs between 
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the plates), which we label as Aq , Aq_ 
resenting the resolved y and z relative displacements. 
These local interactions represent a minimal set of "sticky 
patches" which could arise from hydrophobic, side chain, 
or electrostatic forces. The corresponding spring con- 
stants Xi could be calculated in principle from the details 
of these interactions. We allow only local modifications 
of the contact interactions on binding. The springs local 
to the ligand binding site will be affected by the binding 
of an inducer or corepressor. (Lac actually binds two in- 
ducer ligands, which we simplify here with a single bound 




FIG. 1: Plates and springs model for the interaction of the 
two domains of a repressor dimer. The xray structure (PDB 
1EFA 0) °f the lac repressor dimer (DNA shown in green) is 
shown behind the model. 
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state.) Similarly springs local to the DNA binding site 
will be affected by binding to DNA. The other springs 
act as anchoring potentials. 

We consider vibrations of the plates in 3 different 
planes: (i) in the x, y plane (translational vibrations 
along the x-axis and rotations about the z-axis); (ii) in 
the x, z plane (translation along the x-axis and rotation 
about the y-axis); and (hi) in the y, z plane (translation 
along the z-axis and rotation about the x-axis) . 

The Tirion potential replaces the full MD potential 
with a simple pairwise quadratic potential of universal 
strength and this is found to be sufficient to describe 
the low frequency modes involving coherent motion of 
large groups of atoms. In our model we similarly look 
only at these low frequency modes and describe them by 
harmonic potentials between the protein domains, but 
allow the potentials between the protein monomers to 
acquire locally specific values. 

As an example of detailed calculations we take the 2x2 
system of motions in the y, z plane. We write z 01 = 
z ao — l -^f~ an d z o-i — z oo + 2 ( wnere ®x is the angle of 
rotation about the x-axis) to obtain the Hamiltonian in 
terms of the mutual translational vibration coordinate 
z 00 and mutual rotational vibration coordinate X ■ 



H = -pM _1 p+ -xKx 

where the interaction matrix for the y, z plane 
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and bound to unbound 
This gives us 

(4Aj + M- + f )(4A_! + %i + J*-)' 

A_i Ai Ai A_i 



(4A!A_ 1 + -^ + %i)(4- 

A- 1 Ai 



(4) 

The other modes can be modelled in the same way to 
give additional contributions to AAS. The 2-plate model 
generates a 3 x 3 form of M and K for coupled rotations 
about y and z and translations along x plus one simple 
y-translation. 

We take N = 2 for the lac since it is a tetramer of two 
dimers and similarly for the trp since it represses as a 
dimer of dimers so also has two dimers. 

AAS* > gives the trp case whereby the affinity for 
the holorepressor binding to DNA is greater than the apo 
repressor. AAS" < however gives the lac case since the 
apo lac repressor is the one with the higher affinity for 
DNA. Applying these inequalities to equation (@J gives 
the following rule determining which case arises. 
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trp 
lac 
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The trp case occurs when both spring constants increase 
(Ai > 1, A_i > 1) or decrease (Ai < 1, A_i < 1) on 
ligand binding. The lac case occurs when one spring 
constant increases and the other decreases (Ai > 1 and 
A_i < 1 or Aj < 1 and A_i > 1). Figure □ plots the 



the inertial matrix 
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where m is the reduced mass and I is the reduced moment 
of inertia of the dimer pair. This leads to the partition 
function in the relevant classical limit 



Z = 



"dx„. d9i = 



2irkT 



(|M-i||K|) 



1/2- 



(2) 



Finally from (J2J the entropy of the protein dimer for a 
single plane is 

S = Nk(ln2nkTVmI/l- 



-l/21n(AiA_i+Ao(Ai+A_i)/4)) 

(3) 

where A^ has been abbreviated to A,: for convenience. 
We are interested in the difference between the change 
in entropy on binding at the DNA binding site of the two 
cases in which the protein is, and is not also bound to the 
inducer. We call this AAS* = AShoio — A^apo- A result 
with AAS* ^ would signify cooperative behaviour i.e. 
the binding to DNA is affected by the binding to the 
inducer. We write AAS* in terms of dimensionless spring 




FIG. 2: Graph showing AAS 1 against Ai and A-i. The con- 
tour plot on the base shows regions where lac-type (magenta) 
and trp-type (green) behaviour is optimised. 

function AA5(Ai, A_i) (equation . Biologically rele- 
vant values for the original spring constants were chosen 
using protein £?-factor data (related to the rms positions 
of the atoms B = 8ir 2 < u 2 > 0|) and steered simula- 
tions (see later) giving the case for a potential which is 
stronger at the inducer binding site (Ai > A_i). It can 
be seen that the negative AAS lac-type effect is max- 
imised when the ligand binding at the inducer binding 
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site decreases the spring constant Ai but A_i increases 
on DNA binding. The trp-type effect (positive AAS) is 
maximised when both spring constants decrease as much 
as possible on binding. This reduction will however be 
limited by the physical requirement for overall stability 
of the complex. Requiring that the RMS displacement of 
the monomers be less than the average separation of the 
atoms leads to the estimation that Ai > Q.lkTkr 2 . 

Physically, "entropic allostery" allows the lac inducer 
binding to communicate via the large amplitudes of the 
internal modes of the protein to the "read-head" binding 
regions near the DNA which as a result move too much 
to be inserted into the DNA. 

Estimating the spring constants. We evaluate our 
model in the real example of the lac repressor as an il- 
lustration. Firstly we simply calculated estimates for the 
spring constants from the x-ray crystal S-factors of the 
solved structures in the protein data bank. From the 
solved structure 0, HI, it is clear that Ai decreases 
on inducer binding since the headpiece domains are dis- 
rupted on lac binding. On DNA binding specific inter- 
actions are formed between the repressor headpiece do- 
mains and the base pairs in the DNA major groove which 
would increase the local binding potential and so increase 
A_i. These observations therefore support our finding 
that the lac case has Ai < 1 and A_i > 1. 

We converted the protein data for the B-factors 0, 0, 
Il5| into RMS vibration values and estimated the spring 
constants A^ and A* (in this case averaged over the 
vibrations in the different planes) from the expression 
A ~ 1/ < u 2 > giving Ai » 1.2, A_i w 0.1 (estimated 
from 0), Ai « 0.07 and A_i w 6.7. We now calculate 
an estimate of A AS using equation Q . Including a fac- 
tor of three due to the three planes of vibration, we obtain 
for our plate-dimer model a value of TAAS ~ —lAkT. 
Since the experimental values for the change in binding 
energy between holo and aporepressor binding to DNA 
are AAG ~ 6kT [H H Eg this indicates that the en- 
tropic contribution is likely to be significant since the 
crystal dynamics can only be a lower bound for the am- 
plitudes of vibration. 

To improve upon this rough estimate we calculated the 
interaction energy between the lac monomers in a fully 
atomistic computation using the software "ens" |2j| and 
steering the relative positions of the two monomers. We 
used the xray crystal structure coordinates PDB ID 1LBI 
5] for the aporepressor and 1TLF for the holorepres- 
sor. By relative translation along the 3 axes, and rotation 
about axes at the extremities of the protein dimer, and 
recalculating the total interaction energy at each incre- 
ment, we were able to build up curves for the potential 
energy wells (see Figure |3J). By curve fitting the bottom 
of these wells to a quadratic (to fit with our harmonic ap- 
proximation) we were able to extract the curvature and 
therefore the spring constants for each global mode of 
both the holo and aporepressors. We then used these to 
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FIG. 3: Graph showing an example of a change in spring 
constant with and without the inducer for the lac repressor. 
The blue squares are ens data for 1LBI the aporepressor and 
the red stars are ens data for 1TLF the holorepressor. The 
lines are quadratic fits. 

calculate AS'i n duccrbind which is an estimate for —AAS 1 in 
the case of large A_i . For the motion in the x, y and x, z 
planes (translation along x and rotation about y and z) 

this gave TAAS 1.66fcT, for the y, z plane -0.42fcT 

and for the relative translation along y (which does not 
have rotation modes coupled to it) — 0.28fcT. Interest- 
ingly the softest mode, contributing most to the allostery, 
is the one which shifts the DNA read heads (which point 
in the plane perpendicular to the core) away from the 
DNA perpendicular to it (the x,z plane). Therefore in 
total we have an estimate for AAS* ~ — 2.36fcT. 
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FIG. 4: Graph showing the fraction of DNA sites bound by 
repressors against the concentration of inducer lactose. The 
solid curve includes the entropic component and the dashed 
curve is without the entropic component. 

To check how significant this entropy contribution is 
to the total AAG ~ 6kT we compared the probability of 
genes being repressed (bound by repressor) against lac- 
tose concentration with and without this entropic contri- 
bution (following Yildirim 22]) (see Figure BJ. For 95% 
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activation (operators not bound) 18 lactose molecules are 
required with the entropic contribution but 50 would be 
required if there was no entropic contribution. Note there 
are only of order 100 repressors in the cell [||. This would 
imply that the entropic contribution to allostery is sig- 
nificant in controlling the lactose level at which the gene 
expression is turned on. 

Conclusions. We conclude that inducer binding affects 
the Brownian motion within the repressor protein and 
this entropic effect contributes to the allosteric mecha- 
nism in DNA binding proteins alongside static confor- 
mational changes. We can relate the communication of 
the signal across the protein to "design rules" for the 
potentials within it. 

Several extensions of this approach suggest themselves. 
A discrete many-springs model naturally extends to the 
case of a continuous potential between the plates. Sec- 
ondly, the case of multiple, sequential ligand binding will 
lead to additional structure. To make the model even 
more realistic we should also include bending modes of 
the protein monomers themselves. Significantly, addition 
of such bending modes must increase the predicted AAS" 
value if the ligand binding changes the bending rigidity. 
For lac any such increase might give values of AA5 that 
actually dominate the binding free energy. 

We expect this model can be also applied to transmem- 
brane proteins that transmit signals across membranes 
into cells and organelles |U 0] . These systems are sim- 
ilar in that an inducer ligand (e.g. adrenaline) binds to 
the receptor transmembrane protein which in turn al- 
lows it to bind or unbind proteins on the interior of the 
membrane (e.g. to bind to a G-protein in the control 
cycle for glycogen). Within the restricted environment 
of the membrane we expect entropic allostery to play an 
important role in the transmission of the signal through 
the receptor protein. 

The calculated values of the contribution to the free 
energy change from the change in intramolecular vibra- 
tional entropy of the protein easily reach the order of a 
few kT per molecule, within the experimentally observed 
range for these systems. 
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